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Abstract. Terms considered as trees are very useful tools for presenta- 
tion and manipulation of different data structures in Computer Science 
and Information Technology. Any work of the computer over the trees, 
for instance as graphical user interface (menus of the different windows) , 
XML - documents, definitions and description of hierarchial objects in 
C++ or Java programming etc. can be considered as runs of an au- 
tomaton (the given computer) over these trees. Often the automaton 
loses much time to work over a tree spending it in some useless checking 
of nonessential information. Usually it means that some inputs or some 
subtrees of the given tree are Active for this automaton. The studying 
of Active and essential subtrees of a tree with respect to an automaton is 
main aim of the article. We introduce essential subtrees for terms (trees) 
and tree automata . There are some results concerning independent sets 
of subtrees and separable sets for a tree and an automaton. 
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1. Introduction 

Recall that a tree automaton A is a tuple A = (Q,J-, A, Qf) where: Q is 
set of states, Qf - set of final states, A - set of transition rules and J- is set 
of operation symbols. 

Tree automata work (run) over trees (terms), composed with operation sym- 
bols from T and variables from X = {xi,x%, . . .}. An automaton assigns to 
the vertices of tree some states according to the transition rules from A. A 
tree is recognized by an automaton if it assigns to the root of the tree some 
final state from Qf. In section 2 we give the basic definitions and facts from 
universal algebra (see [H E] ) and trees (see [H [5] ) . In section 3 the con- 
cept of essential inputs and essential subtrees for a term and an automaton 
is introduced. An subtree U of t is essential for the tree t and an automaton 
A iff there are two runs of A over t whose initial states are different on 
the inputs belonging to t{ only, and the resulting states of these runs are 
different. A generalization of essential subtrees is the concept of separable 
sets of subtrees. 



2 



SLAVCHO SHTRAKOV 



2. Trees and Independent Sets 

Let T be any finite set, the elements of which are called operation symbols. 
Let r : T — > iV be a mapping into the non negative integers; for / G f, 
the number r(/) will denote the oriiy of the operation symbol /. The pair 
(J 7 , t) is called type or signature. If it is obvious what the set J 7 is, we will 
write "type r". The set of symbols of arity p is denoted by T v . Elements 
of arity 0,1, ... ,p respectively are called constants (nullary ) , unary,..., p-ary 
symbols. We assume that JF ^ 0. 

Definition 1. Let X n = {x±, . . . ,x n },n > 1, be a set of variables with 
X n n T = 0. The set W T (X n ) of n—ary terms of type r with variables from 
X n is defined as the smallest set for which: 
(i) C W T (X n ) and 
(«*) A„ C W T (A n ) and 

(tit) if p > 1,/ G ^> and t\, . . . ,t p G W" T (X n ) then /(*!, ... ,t p ) G W T (X n ). 
By W T (X) we denote the following set 

W T {X) := U™ =1 W T (X n ), 

where X = {x±,X2, ■ ■ ■}■ 

Let t be a term. By Var(t) the set of all variables from X which occur in t 
is denoted. The elements of Var{t) are called input variables (inputs) for t. 
If t G W T (X) and s x G W T (X) then the term denoted by t(x <— s x ), is 
obtained by substituting in t, simultaneously for every x G X, s x . 
H t,s x G W r (X n ), one may then write t(x ■<— s x ) in the more explicit form 

t{x\ < S Xl , ■ ■ ■ , X n < Sx n ) • 

Any subset L of W T (X) is called term-language or tree- language. 
If X = then W T (X) consists of so called ground terms, which can be viewed 
as usual terms after replacing in them the all variables with some nullary 
operation symbols. 

Let t be a term of type r. The depth of t is defined inductively: if £ G XUJ-q 
then Depth(t) = 0; and if t = f(t±, . . . ,t n ) then 
Depth(t) = max{Depth(ti), Depth(t n )} + 1. 

Let -/V be the set of natural numbers and N* be the set of finite strings over 
N. The set N* is naturally ordered by n < fn <^=^ n is a prefix of m. The 
empty string in N* we will denote by e. 

A term t G W T (X) can also be defined as a partial function t : N* — > T U X 
with domain Pos(t) C -/V* satisfying the following properties: 
(i) e G Pos(t) and Pos(t) is prefix-closed; 

(m) For each p G Pos(t), if G .Fn, n > 1 then G Pos(t)} = 

{1, . . . ,n}; 

(ra) For each p G Pos(t), if i(p) £lUf then {i\pi G Pos(i)} = 0. 
The elements of Pos(t) are called positions. 

A subterm (subtree) t\p of a term i G W T (X) at position p is the tree with 

Pos(%) = {q G Poa(i) | p -< g}. 

The set of all subtrees of t is denoted by Sub(t). 
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We denote by < the subterm ordering, i.e. we write t <i' if there is a position 
p G Pos(t') such that t = t'\ p and one says that t is a subterm of t' . We 
write t < t' if t < t' and t + t'. 
Clearly, t\ p < t|g if and only if g -< p. 

A chain of subterms Ch := t Pl < t P2 <d . . . < t p , is called strong if for all 
j G {1, . . . , k — 1} there does not exist any term s such that i Pj < s < 

Lemma 1. A chain t|pj- < . . . < i|pj: is strong if and only if p i+1 -< pi and 
there are integers ki G N such that pTfl = plki for each i,i £ {1, . . . , k — 1}. 

Definition 2. Let Pi,p2 ^ Pos{t). The positions ^ and p 2 are called 
independent if Pos(t |pj-) D Pos(t\p^) = 0. 

If ti,t2 ^ Sub(t) are the subtrees corresponding to the positions pT, p2 G 
Pos(t) i.e. ti = and i2 = ^Ipj which are independent then the subtrees 
t± and ^2 are called independent, too. 

The set Ind(pi) consists of all positions q £ Pos(t) which are independent 
on p\. Analogously, the set Ind(t\) consists of all subtrees s € Sub{t) which 
are independent on t\. 

Definition 3. A set of strings over the set of natural numbers P is called: 
(£) prefix closed iff p G P and q -<p imply q £ P; 

(ii) prefix determined w.r.t. the set Q, Q C N* iff p G P, p -<q and q G Q 
imply q G P. 

Lemma 2. For each p, p G Pos(t) the set Ind(p) is prefix determined w.r.t. 
Pos(t). 

Proof. Let p G Pos(t) and q G Ind(p). Hence Pos(i|p) n Pos(t| ? ) = 0. 
Clearly if q -< r and r G Pos(t) then r G Pos(t|q). From Pos(f|r) c Pos(i|g) 
it follows that Pos(t| F ) n Pos(i| p ) = i.e. r G Ind(p). ■ 

Example 1. Let us illustrate all these notion by the following tree: 

* = /i(5(/i(^i 5 ^2)),/2(5(/i(^3,/i(^4,^3))),ff(/i(2;2,2;i)))) drown on Figure 

m 

1. The set of positions of this tree is Pos(t) := 

:= {0, T, 2, IT, ITT, TT2, 21, 2TT, mT, 2TT2, 2TT2T, 2TT22, 22, 22T, 22TT, 22T2} . 

2. The set of independent positions on the position 111 is: 

Jnd(TTI) = {2, 21, 2TT, 2TT1 , 2TT2 , 2TT2T, 2TT22 , 22, 22T, 22TT, 22l2 } . 

3. The set of independent subtrees on the tree is: 

ind(t\z) = {t|T,t|ir.t|nr>*lii2}- 
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Figure 1. 

3. Finite Tree Automata and Essential Subtrees 

Definition 4. A finite tree automaton is a tuple A = {Q,^, Qf, A) where: 

- Q is a finite set of states; 

- Qf Q Q is a set of final states; 

- A is a set of transition rules i.e. if 

f = f UfiU...Uf n then A = {A ,Ai,...,A n }, 

where Aj are mappings Ao : Fq — > Q, and Aj : .Fj x Q l — > Q, for « = 1, . . . , n. 

We will suppose that A is complete i.e. the A's are total functions on their 
domains. 

Let Y C X be a set of input variables. Ass(Y,J r o) denotes the set of all 
functions (evaluations) 7 : Y — > JFq. 

Let t G W T (X), 7 G As^y,.^) and Y = {xi,...,x m }. By 7(t) the term 
7(t) = t(xi <— 7(^1), . . . , x m <— 7(x m )) will be denoted. 
So, each assignment 7 G ^4ss(y, JT ) can be extended to a mapping defined 
on the set W T (X) of all terms. 

Let 7 G ylss(A, ^o)- The automaton A = (Q,J^, Qf, A) runs over t and 
7. It starts at leaves of t and moves downwards, associating along a run a 
resulting state with each subterm inductively: 

If t = Xi G X then the automaton associates with t the state q G Q, with 
g = A (7(xj)). 

If t = /o G ^0 then the automaton associates with t the state q = A (/o). 
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If t = f(ti, . . . , t n ) and the states q\, . . . , q n are associated with the sub- 
terms (subtrees) t±, . . . , t n then with t the automaton A associates the state 
q, with q = A n (f,q 1 ,.. .,q n ). 

The automaton runs over ground terms and each assignment from Ass(X, Fq) 
transforms any tree as a ground term. 

A term t ,t G W T (X) is accepted by a tree automaton A = (Q,F, Qf, A) 
if there exists an assignment 7 such that when running over t and 7 the 
automaton A associates with t a final state q £ Qf. 

When A associates the state q with a subterm s, we will write .4(7, s) = q. 
If 7 € Ass(Y,f r o), Y C X then .4.(7, i) is resulting tree obtained from t 
under applying all possible transition rules from A over the tree 7(i). 
Let t € W T {X) be a term and A be a tree automaton which accepts t. In 
this case one says that A recognizes t or t is recognizable by A. The set of 
all by A recognizable terms is called tree-language recognized by A. 

Definition 5. Let t G W T (X) and let A be an automaton. A subtree 
t\p, p € Pos(t) is called essential for the pair (t, A) if there exist two assign- 
ments 71, 72 € Ass(X, To) such that .4.(71, i|p) / A(j2, and 71 (xj) = 
72 (xj) for all Xj £ X \ Var(t\p) with .4.(71, t) 7^ .4(72, i) i.e. .4 stops in 
different states when running over t with 71 and with 72. 

The set of all essential subtrees for the pair (t, A) is denoted by TEss(t, A). 
The subtrees from Sub(t) \ TEss(t,A) are called fictive for (t,A). 
A position p G Pos(t) is called essential for (t, A) if t\p € TEss(t, A). 
PEss(t, A) is the set of all essential positions of t. 

Analogously, the positions from Pos{t) \ PEss(t, A) are called fictive for 
(t,A). 

The set of essential input variables is denoted by Ess(t,A). It is studied in 

M- 

The essential variables for discrete functions are considered in [3j [7] . Essen- 
tial input variables and separable sets for terms in the concept of universal 
algebra are considered in [8]. 

Example 2. Let us consider the automaton defined as follows: 
A = {Q,!F,A, Qf), where Q = {qo,Qi}', f = f U f 1 U f 2 with F = 
{0, 1}, T\ = {g}, T 2 = {/1, / 2 }; Q/ = {^1} and A is defined by the equa- 
tions A (0) = q , Ao(l) = ?i; Ai(5,g ) = qi, &x{g,qi) = qo] A 2 (/i,gi,g 3 -) = 
g m , m = I if qi = qj = q\ and m = otherwise; A2(/2, (jj) = <Zz, i = 
if % = = go and I = 1 otherwise. 

Let us check whether the subtree t\21 = g{fx{x%, /i(#4, X3))) is essential 
for (t,A). Its set of independent subtrees is prefix determined by the po- 
sitions 1 and 22. The set of inputs of these subtrees is {27, #2}- All pos- 
sible assignments for our checking are sixteen (?). We will calculate the 
automaton's work only for two of these assignments 71 (27, X2, £3, £4) = 
(0, 0, 0, 1) and 72(27, x 2 , x%, X4) = (0, 0, 1, 1). We obtain the following equa- 
tions: .4.(71, *Itt) = -4(72,%) = Qo, .4(7i, ^221) = *4(72,t|22T) = qo and 
■4(71, *It) = -4(72, *It) = ?i > -4(71, t\&) = -4(72, fe) = ?i- Analogously we 
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Figure 2. 

have .4(71, 1 1 2H2) = go, 4(7i, 1 1 an) = g , 4(7i, %) = gi, and 4.(72, = 
gi, 4(72,t|2Tr) = 9l, 4(72,^21) = go- 
In the same way we obtain 
*4(7l,*ll) = 91, 4(7i, 1 1 0) = gi, and 
4(72,^2) = 91, 4(72, t|o) = 91- 

In the same way it can be checked that for all pairs of assignments with 
equal values on {x±,X2} the automaton will stop in the same states. Hence 
1 121" is not essential subtree for (t,A). 

Now, let us check whether t\ji is essential. Consider the following assign- 
ments: 73(2:1, X2, x 3 , X4) = (0, 1, 1, 0) and 74(^1, x 2 , x 3 , 24) = (1, 1, 1, 0). It is 
easy to see that 4(73, = go, and 4(74, = gi- 

So, we obtain 4(73, t) = gi, and 4(74, t) = go- Thus t\jj is essential for 
(*,4). 

Finally it can be checked that for each possible assignment 7 € Ass(X, Tq) 
it holds 4(7, t) = 4(7, t\ T ). 

This equation implies that the resulting states when 4 running over t are 
fully determined by the corresponding states when running over the tree 
t\j = g{f\{x\, X2)) with the graph given on Figure [2j 

It is clear that the runs of 4 over t\j can be calculate more quick than 
the runs over t. 

Theorem 1. The set PEss(t, 4) is prefix closed. 
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Proof. We have to prove that if t\p G TEss(t, A) then each subtree 
t\g with q ~i p is essential for (t, A). Let 71,72 G Ass(X, To) be two 
assignments, such that 71 (xj) = 72(xj), for Xj G X \ Var(t\p) with 

^(71,^)^-4(72,%) 

and 

A(j u t) ^ A{j 2 ,t). 
Let g be a prefix of p i.e. q <p. 

At first, suppose .4(71, t|g) / -4(72>i|g)- Then t\g G T£'ss(t,-4) because of 
Far(t| F ) C Far(ty. 
Secondly, let us suppose 

(1) Afrl, t\jj) =A(-f 2 ,t\g). 

From ^.(71, t\ ¥ ) ^ A(-y 2 ,t\p) it follows that Y = Var(t\^)\Var(t\ f ) ^ 0. Con- 
sider the assignments 7 G ^4ss(X\yar(t[p), J^o) and &,/3 £ ^4ss(yar(t|p), JF ) 
defined as follows 7(xj) = 7i(xi) = 72 (xj) for all ^ e 1\ yar(t|p), a(xj) = 
71 (xj) and /?(xj) = 72 (xj) for all Xj G Var(t\p). 

Clearly 7a = 71 and 7/? = 72. From ([1]) we have *4(a, i|q) = A{j3 : t\q). Hence 

Ainut) = Afra,t) = Afr,A(a,t\j)) = A(i,A(f3,t\q)) = A(rfP,t) = A(-y 2 ,t). 

This contradicts to t\p G TEss(t, A). B 

Corollary 1. [9] If Xj is essential input variable for (t, A) then there exists 
a strong chain Xj = ii < £2 < • • • <J ifc <! * such that Xj G Ess(tj,A) for 
j = l,. 

Corollary 2. If £|p is essential subtree for (t, A) and 71,72 G Ass(X, Fq) 
are as in Theorem Q] then A{^\,t\q) / -4(72, for each g, q ^p. 

Corollary 3. The set P = {p G Pos(t) | p ^ P£s,s(£, .4)} is prefix deter- 
mined w.r.t. Pos(t). 

Proof. Suppose that q G Pos(t), q ~<f, r G Pos(t) and <j ^ PEss(t,A). 
If we suppose that r G PEss(t, A) then we obtain g G PEss(t, A) which is 
a contradiction. B 
Two sets of subtrees Y and Z are called independent (Y ^ Z) if for each 
it G Y" and for each v € Z the inclusions Y C Ind(v) and Z C Ind(u) are 
held. 

Definition 6. Let t G W T (X) and -4 be an automaton. A set Y C 
TEss(t, A) is called separable for i and -4 w.r.t. a set 2 C TEss(t, A), 
Y ^ Z if there is an assignment 7 on U sG ^Far(s) \ U re yFar(r) such that 

rcm«(7(t),x). 

The set of all separable sets of subtrees for t and A w.r.t. Z will be denoted 
by TSep(t, A, Z). When Y is separable for t and A w.r.t. Z = IndiY) where 
Ind(Y) = U se y/nd(s) then the set Y is called separable for t and A and the 
set of such Y will be denoted by TSep(t,A). 
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When a set of essential subtrees is not separable, it will be called inseparable. 
It is not difficult to see that if V7 G Ass(X, Tq) 4(7,0 = 4(7, i) then 

TEss{t,A) = TEss(t',A). 

Theorem 2. If t' is an essential subtree of t and V7 G Ass(X, J-q) 

(2) A( 1 ,t')=A( 1 ,t) 

then each subtree s G Ind(t') with Far(s) \ yar(t') 7^ 0, is nonessential for 
(t,A). 

Proof. Suppose there is a subtree s of t with s G Ind{t') n TEss(t,A). 
Then there are two assignments 71,72 6 vlss(X, ^0) such that 4 (71, s) 7^ 
4(72, s), and 7i(xj) = 72(xj) for all Xj £ X \ Var(s) with 

(3) 4(71, *) 7^4( 7 2, t). 

Now, s G Ind(t r ) implies that 

(4) 4(71,0=4(72,0- 
On the other side by ([2]) and (j4]) we obtain 

4( 7 i, t) =4(71,0 =4(72,0 = 4(72, t) 
which contradicts to ([3|). B 

Theorem 3. If Y G TSep(t, A) then for every subtree s £ Y there exists 
at least one strong chain s = t\ <1 12 < ■ ■ ■ < ifc <3 i such that s G TEss(tj,A) 
for j = 1, . . . , fe. 

Proof. Let t = /(si, . . . , s n ). By s G TEss(t, A) it follows that there are 
two assignments 71,72 G Ass(X, ^b) such that 

4(7i, s) 7^ 4(72,s), and 7i(iCj) = 72(^3) for all Xj G Far(i|g), t\g G Ind(s) 
with 4(7i, t) # 4(72, *)■ 

At first, if Depth(t) — Depth(s) = 1 then the chain s < f is strong and the 
theorem is proved in this case. 

Secondly, let us assume Depth(t) — Depth(s) > 2. Suppose that s £ 
TEss(sj,A) for all j, j G {1, . . . , n}. This implies that 4 (71, Sj) = 4(72, Sj). 
1, . . . , n. Let us calculate again 4(71, t) and 4(72, t). So, we have 
4(7i, t) = A n (/,4(7i,si),. . .,4(71,%)) = A n (/,4(72,si), . . . ,4(72,%)) = 
4(72, i). This is a contradiction. Hence there exists a subterm Sj, j G 
{1, . . . , n} of i such that s G TEss(sj,A). Let us set = Sj. Clearly <\t 
is a strong chain. We can repeat this procedure for t/. instead of t and 
we shell get a subterm tk-\ of tk such that tk-i < tk < t is a strong chain 
and s G TEss(tk—i, 4). This process can be continued until we obtain a 
term ti with Depthfo) — Depth(s) = 1 and s G TEss(ti,A). Obviously 
if we set s = ti then s = ti < t2 < ■ ■ ■ < t^ < t is & strong chain with 
s €TEss(tj,A), j = l,...,k. ■ 
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